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(54) Digital data watermarking 

(57) Digital watermarking of data, including image, 
video and audio data, is performed by repeatedly insert- 
ing the watermark into subregions or subimages ol the 
data. Similarly, the watermark is repeatedly extracted 
from the subregions of the data. 
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Description 

The present invention relates to digital watermark- 
ing of data including image, video and multimedia data. 
Specifically, the invention relates to the insertion and 5 
extraction of embedded signals for purposes of water- 
marking, in which the insertion and extraction proce- 
dures are repeatedly applied to subregions of the data. 
When these subregions correspond to the 8x8 pixel 
blocks used for MPEG and JPEG compression and w 
decompression, the watermarking procedure can be 
tightly coupled with these compression algorithms to 
achieve very significant savings in computation. 

The proliferation of digitized media such as image, 
video and multimedia is creating a need for a security 15 
system which facilitates the identification of the source 
of the material. 

Content providers, i.e. owners of works in digital 
data form, have a need to embed signals into 
video/image/multimedia data which can subsequently so 
be detected by software and or hardware devices for 
purposes of authenticating copyright ownership, control 
and management. 

For example, a coded signal might be inserted in 
data to indicate that the data should not be copied. The 25 
embedded signal should preserve the image fidelity, be 
robust to common signal transformations and resistant 
to tampering. In addition, consideration must be given to 
the data rate that can be provided by the system, 
though current requirements are relatively low - a few 30 
bits per frame. 

In EP-A-766 468 corresponding to U.S. Patent 
Application Serial No. 08/534,894, filed September 28, 
1995, entitled "Secure Spread Spectrum Watermarking 
for Multimedia Data" and assigned to the same 35 
assignee as tie present invention, which is incorporated 
herein by reference, there was proposed a spread spec- 
trum watermarking method which embedded a water- 
mark signal into perceptually significant regions of an 
image for the purposes of identifying the content owner 40 
and/or possessor. A strength of this approach is that the 
watermark is very difficult to remove. In tact this method 
only allows the watermark to be read if the original 
image or data is available for comparison. This is 
because the original spectrum of the watermark is 45 
shaped to that of the image through a non-linear multi- 
plicative procedure and this spectral shaping must be 
removed prior to detection by matched filtering and the 
watermark is inserted into the N largest spectral coeffi- 
cients, the ranking of which is not preserved after water- so 
marking. Thus, this method does not allow software and 
hardware devices to directly read embedded signals. 

In an article by Cox et al. ( entitled " Secured Spec- 
trum Watermarking for Multimedia" available at 
httpV/www.neci. nj.com/tr/irxiex. html (Technical Report ss 
No. 95-10) spread spectrum watermarking is described 
which embeds a pseud o- random noise sequence into 
the digital data for watermarking purposes. 



The above prior art watermark extraction methodol- 
ogy requires the original image spectrum be subtracted 
from the watermark image spectrum. This restricts the 
use of the method when there is no original image or 
original image spectrum available. One application 
where this presents a significant difficulty is for third 
party device providers desiring to read embedded infor- 
mation for operation or denying operation of such a 
device. 

In US Patent No. 5,319,735 by R.D. Preuss et al 
entitled "Embedded Signalling" digital information is 
encoded to produce a sequence of code symbols. The 
sequence of code symbols is embedded in an audio sig- 
nal by generating a corresponding sequence of spread 
spectrum code signals representing the sequence of 
code symbols. The frequency components of the code 
signal being essentially confined to a preselected sign- 
aling band lying within the bandwidth of the audio signal 
and successive segments of the code signal corre- 
sponds to successive code symbols in the sequence. 
The audio signal is continuously frequency analyzed 
over a frequency band encompassing the signalling 
band and the code signal is dynamically filtered as a 
function of the analysis to provide a modified code sig- 
nal with frequency component levels which are, at each 
time instant, essentially a preselected proportion of the 
levels of the audio signal frequency components in cor- 
responding frequency ranges. The modified code signal 
and the audio signal are combined to provide a compos- 
ite audio signal in which the digital information is 
embedded. This component audio signal is then 
recorded on a recording medium or is otherwise sub- 
jected to a transmission channel. Two key elements of 
this process are the spectral shaping and spectral 
equalization that occur at the insertion and extraction 
stages, respectively, thereby allowing the embedded 
signal to be extracted without access to the unwater- 
marked original data. 

In EP-patent application No. 97 11 53 53.1 corre- 
sponding to U.S. Patent Application Serial No. 
08/708,331. filed Septembers 1996, entitled "A Spread 
Spectrum Watermark for Embedded Signaling" by Cox, 
and incorporated herein by reference, there is 
described a method for extracting a watermark of 
embedded data from watermarked images or video 
without using an original or unwatermarked version of 
the data. This work can be viewed as an extension of 
the original work of Preuss et al from the audio domain 
to images and video. 

This method of watermarking an image or image 
data for embedding signaling requires that the DCT 
(discrete cosine transform) and its inverse of the entire 
image be computed. There are fast algorithms for com- 
puting the DCT in N log N time, where N is the number 
of pixels in the image. However, for A/=512x512. the 
computational requirement is still high, particularly if the 
encoding and extracting processes must occur at video 
rates, i.e. 30 frames per second. This method requires 
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approximately 30 times the computation needed for 
MPEG-II decompression. 

One possible way to achieve real-time video water- 
marking is to only watermark every N m frame. However, 
content owners wish to protect each and every video 
frame. Moreover, if it is known which frames contain 
embedded signals, it is simple to remove those frames 
with no noticeable degradation in the video signal 

In U.S. Patent Application Serial No. 08/715.953, 
filed September 19, 1996, entitled "Watermarking of 
Image Data Using MPEG/SPEG Coefficients" by Cox, 
and incorporated herein by reference, there is 
described an alternative method, which is to insert the 
watermark into n x n blocks of the image (subimages) 
where n«N Then the computation cost is 



^ n log n = N log n 



For N=512x512=2 18 and n=Sx8=2 6 1 the asymp- 
totic saving is only a factor of 3. However, empirically the 
cost of computing the DCT over the entire image may 
be significantly higher when cache, loop unfolding and 
other efficiency issues are considered. Thus, the practi- 
cal difference may approach a 30 fold savings. More 
importantly if the block size is chosen to be 8x8. i.e. the 
same size as that used for MPEG image compression, 
then it is possble to tightly couple the watermark inser- 
tion and extraction procedures to those of the MPEG 
compression and decompression algorithms. Consider- 
able computational saving can then be achieved since 
the most expenses computations relate to the calcula- 
tion of the DCT and its inverse and these steps are 
already computed as part of the compression and 
decompression algorithm. The incremental cost of 
watermarking is then very small, typically less than 5% 
of the computational requirements associated with 
MPEG. 

The present invention improves the reliability of the 
invention described in the 08/715,953 application by 
storing watermark information into subimages, and 
extracting watermark information from subimages, in a 
manner different from that described earlier. 

The present invention improves the reliability of the 
prior systems by systematically varying the order in 
which watermark signal components are inserted into 
each subimage, by inserting only part of the watermark 
signal into each subimage, and. during watermark 
detection, by combining the watermark signals found in 
groups of subimages to reconstruct the original water- 
mark signal before testing for correlation with any pre- 
defined watermarks 

For detection, a reverse transformation is applied to 
each subimage to reconstruct the watermark informa- 
tion that was stored in that subimage. The resulting sig- 
nals are then averaged together to reconstruct the 
whole watermark, and to reduce noise. Finally, this 



reconstructed watermark is compared against a prede- 
fined set of watermark signals to determine which one 
was inserted into the image. 

A principal object of the present invention is there- 
5 fore, the provision of inserting a subset of a watermark 
into a subset of subregions of data to be watermarked. 

Another object of the invention is the provision of a 
digital watermarking system in which a watermark is 
extracted by averaging the watermarked signal from 
jo subregions of watermarked data, and then correlating 
the resulting signal to determine the watermark. 

A further object of the invention is the provision of a 
digital watermarking system in which the watermark is 
composed of two portions, a verification portion and a 
is synchronization portion, in order to improve watermark 
extraction reliability 

Further and still other objects of the invention will 
become more clearly apparent when the following 
description is read in conjunction with the accompany- 
20 ing drawing. 

BriFif Description of the Drawings 

Figure 1 is a schematic block diagram of watermark 
25 insertion procedure; 

Figure 2 is a schematic block diagram of a water- 
mark insertion procedure in accordance with the 
teachings of the present invention; 
Figure 3 is a schematic block diagram of a water- 
30 mark extraction procedure; 

Figure 4 is a schematic block diagram of a water- 
mark extraction procedure in accordance with the 
teachings of the present invention; 
Figure 5 is a graphic representation of a zigzag pat- 
35 tern useful for vectorizing subimages; 

Figure 6 is a graphic representation of rotation of 
PN sequences; 

Figure 7 is a graphical representation of an 8x8 
block shown the spatial relation of averaged terms; 
40 Figure 8 is a schematic block diagram of a method 
for inserting watermarks in accordance with the 
present invention; and 

Figure 9 is a schematic block diagram of a method 
for extracting watermarks in accordance with the 
45 present invention. 



nailed Description 

Referring now to the figures, and Figures 1 through 
so 4 in particular, there is shown schematic block diagrams 
of a general method for inserting and detecting water- 
marks in digital data, for instance images. 

In the following description, reference may be made 
to image data or images. While the invention has appli- 
55 cability to image data and images, it will be understood 
that the teachings herein and the invention itself are 
equally applicable to video, image and multimedia data 
and the term "image" and "image data" will be under- 
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stood to include these terms where applicable. As used 
herein, \vatermark" will be understood to include 
embedded data, symbols, images, instructions or any 
other identifying information. 

In the following description, reference is made to 5 
procedures described in U.S. Patent Application Serial 
No. 08/534,894 for inserting and extracting or detecting 
a watermark in images as INSERT-ORIGINAL and 
EXTRACT-ORIGINAL, respectively. Reference is made 
to procedures described in U.S. Patent Application w 
Serial No. 08/708,331 filed September 4 t 1996 for 
inserting and extracting or detecting watermarks in 
images as INSERT-WHOLE and EXTRACT- WHOLE, 
respectively. And reference is made to procedures 
described in U.S. Patent Application Serial No. 15 
08/715,953 for inserting and extracting or detecting 
watermarks in images as INSERT-MPEG- A and 
EXTRACT-MPEG-A, respectively. 

Figure 1 shows a schematic block diagram of 
INSERT- WHOLE procedure for inserting watermarks 20 
into images. The watermark signal, in the form of a finite 
sequence of symbols chosen from an alphabet, is pro- 
vided as an input to an error correction encoder 10 
which transforms this sequence into another sequence 
that contains redundant information. The output of 25 
encoder 1 0 is provided to a PN-mapper 1 1 . which maps 
each symbol of the encoded watermark into a pre-spec- 
rfied pseudo-random noise (PN) code. The output of the 
PN-mapper 1 1 is provided to a spectral transformer 12, 
which converts the pseudo-random noise sequence into 30 
the frequency domain. The conversion preferably is by 
discrete cosine transform (DCT), however, fast fourier 
transform, wavelet type decomposition and the like may 
also be used for frequency conversion. Concurrently, 
the data to be watermarked is provided to another spec- 35 
tral transformer 13. The outputs of the two spectral 
transformers 12 and 13 are then provided as inputs to a 
spectral shaper 14, which modifies the spectral proper- 
ties of the pseudo-random noise codes from spectral 
transformer 12 to mask the watermark when added to 40 
the image data The spectrally transformed data to be 
watermarked, from spectral transformer 13, is also pro- 
vided as an input to a delay 1 5. The output of the spec- 
tral shaper 1 4 is then added to the output of delay 1 5 at 
a summer 16. The summer output is subject to an 45 
inverse transform 1 7. The result of the inverse transform 
is watermarked data. 

INSERT-MPEG- A differs from INSERT-WHOLE by 
segmenting the data to be watermarked into multiple 
blocks, such as 8x8 pixel subimages or subregions. so 
Each block of data then has the watermark inserted 
according to the above described method. That is. for 
each 8x8 subimage or subregion. a pseudo-random 
number (PN) sequence is inserted into the DCT coeffi- 
cients after suitable spectral shaping. The procedure is 55 
repeated for all such subimages or subregions. The size 
of the subimage or subregion is preferably 8x8, but it 
can be of other sizes, such as 2x2, 3x3. 4x4 or 16x16. 



Figure 2 shows a schematic block diagram of a 
watermark insertion procedure in accordance with 
teachings of the present invention. The watermark sig- 
nal is processed into a noise spectrum signal by the 
error correction encoder 20, the PN mapper 21 , and the 
spectral transformer 22, in the same manner as 
described in conjunction with Figure 1. However, unlike 
INSERT-WHOLE or INSERT-MPEG-A, the watermark is 
then used as an input to a watermark segmenter 23, 
which systematically separates the watermark into sev- 
eral subwatermarks. Any portion of the original water- 
mark might appear redundantly in several of the 
resulting subwatermarks. Concurrently, the data to be 
watermarked is used as an input to data segmenter 24, 
which segments the data into blocks or subregions, 
such as 8x8 subimages, as in INSERT-MPEG-A. Each 
of the subwatermarks output by the watermark seg- 
menter 23 is then inserted into a data block by one of 
the watermark inserters 25a. 25b, etc. The procedure 
used by the watermark inserters 25a, 25b, etc., is the 
same procedure described connection with watermark 
inserter 18 in Figure 1. That is, each subwatermark is 
added into a spectrally transformed data block after 
spectral shaping, and the resulting data is then trans- 
formed back into the spatial domain. Finally, the water- 
marked data blocks from the watermark inserters 25a, 
25b, etc., are assembled by data combiner 26 to pro- 
duce watermarked data 

Figure 3 shows a schematic block diagram of the 
EXTRACT-WHOLE procedure. The watermarked 
image, video or multimedia data is first used as input 
into a spectral normalizer 30 to undo any previously per- 
formed spectral shaping. If the data contains a water- 
mark, then the output of the spectral normalizer 30 will 
resemble the spectral transformation of the PN coding 
of that watermark (the signal that was input to the spec- 
tral shaper 14 in Figure 1). The output of the spectral 
normalizer 30 is then used as an input to several corre- 
lators 3la, 31b, etc., which test the watermark with the 
PN codes used to represent the various symbols that 
the encoded watermark might contain (i.e. each correla- 
tor tests for one PN code that is used to encode a sym- 
bol by the PN mapper 1 1 of Figure 1). The outputs of the 
correlators 31a. 31b, etc., are used as inputs to a deci- 
sion circuit 32, which determines the most likely 
sequence of symbols. Finally, this sequence is cor- 
rected by an, error corrector 33, which performs the 
inverse coding that was performed by the error correc- 
tion encoder 10 in Figure 1. The result is the extracted 
watermark. 

In EXTRACT-MPEG-A, the data from which a 
watermark is to be extracted is first segmented into sev- 
eral blocks, such as 8x8 subimages, exactly as in 
INSERT-MPEG-A. The signal from each subimage is 
then normalized and used as input into a bank of corre- 
lators similar to the correlators 31a. 31b, etc. in Figure 
3. The output from the correlators is then averaged with 
the outputs of corresponding correlators from other 
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subimages, and the resulting average correlations are 
used as inputs into the decision circuit 32 for subse- 
quent processing as described above. 

Figure 4 shows a schematic block diagram of a 
watermark extraction procedure in accordance with the 
teachings of the present invention. The watermarked 
data is first segmented into blocks by data segmenter 
40, which corresponds to the data segmenter 24 used 
during the insertion procedure in Figure 2. Each of the 
data blocks is provided to a respective spectrum nor- 
malizer 41a, 41b, etc. to produce a signal resembling 
the subwatermark that was inserted into the respective 
data block. These inserted subwatermark signals are 
then used as inputs into a watermark combiner 42. In 
the combiner 42, parts of the watermark that appear 
redundantly in several subwatermarks are averaged 
together to reduce noise. The output of the watermark 
combiner 42 is provided as the input to a symbol sepa- 
rator 43 which divides the watermark into parts, each of 
which corresponds to one symbol from the encoded 
watermark signal (the output of error correction encoder 
20 in Figure 2). 

These symbols from separator 43 are provided as 
inputs to respective watermark identifiers 44a. 44b etc. 
each of which includes of a bank of correlators and a 
decision circuit, as shown in Figure 3. The outputs of the 
watermark identifiers are symbols from the alphabet 
used in the original encoded watermark signal. The 
identified symbols are reassembled into a complete 
encoded watermark by the symbol combiner 45. Finally, 
the resulting encoded watermark is decoded by the 
error corrector 46. 

The insertion and extraction procedures will now be 
described in more detail. In INSERT-ORIGINAL and 
EXTRACT-ORIGINAL, the object is to embed a single 
PN (pseudo random number) sequence into an image 
when the original image is available at the time of 
extraction. The information associated with the PN 
sequence is assumed to be stored in a database 
together with the original image and the spectral loca- 
tion of the embedded watermark. The locations of the 
watermarked components has to be recorded because 
the implementation approximated the N perceptually 
most significant regions of the watermark by the N larg- 
est coefficients. However, this ranking was not invariant 
to the watermarking process. The N largest coefficients 
may be different after inserting the watermark than 
before inserting the watermark. 

In order to avoid this problem, the present invention 
places a watermark in predetermined locations of the 
spectrum, typically the first N coeff icients. However, any 
predetermined locations could be used, though such 
locations should belong to the perceptually significant 
regions of the spectrum if the watermark is to survive 
common signals transformations such as compression, 
scaling, etc. 

More generally, the information to be embedded is 
a sequence of m symbols drawn from an alphabet A 



(e.g. the binary digits or the ASCII symbols). This data 
is then supplemented with additional symbols for error 
detection and correction. Each symbol is then spread 
spectrum modulated, a process that maps each symbol 

s into a unique PN sequence known as a chip. The 
number of bits per chip is preset - the longer the chip 
length, the higher the detected signal-to-noise ratio will 
be, but this is at the expense of signaling bandwidth. 
The power spectrum of the PN sequence is white, 

w i.e. flat, and is therefore shaped to match that of the 
"noise", i.e. the image/video/audio/or multimedia data 
into which the watermark is to be embedded. It is this 
spectral shaping that must be modified from the prior 
methods so that the extraction process no longer 

15 requires the original image. To do this, each coefficient 
of the watermarked spectrum is scaled by the local 
average of the power in the image spectral coefficient 
rather than the coefficient itself, i.e. 

20 / / = / / + aavg(K / |)lV / (1) 

The averaging is the averaging of the absolute coef- 
ficient values and not the coefficient values themselves. 
This is effectively estimating the average power present 

25 at each frequency. Other averaging procedures are pos- 
sible, for example, averaging over several frames or 
average of local neighborhoods of 8x8 blocks. 

This average may be obtained in several ways. It 
may be a local average over a two dimensional region. 

30 Alternatively, the two dimensional spectrum may be 
sampled to form a one dimensional vector and a one 
dimensional local average may be performed. One 
dimensional vectorization of the two dimensional 8x8 
DCT coefficients is already performed as part of MPEG 

35 ll.The average may be a simple box or weighted aver- 
age over the neighborhood. 

For video data, temporal averaging of the spectral 
coefficients over several frames can also be applied. 
However, since several frames are needed for averag- 

40 ing at the spectral normalization stage of the extractor, 
the protection of individual video frames taken in isola- 
tion may not be possible. For this reason, the present 
invention treats video as a very large collection of still 
images. In this way, even individual video frames are 

45 copy protected. 

In order to extract the watermark, it is necessary to 
perform the spectral normalization, in which the previ- 
ously performed spectral shaping procedure is inverted. 
In the present invention, the original unwatermarked 

so signal is not available. Thus, the average power of the 
frequency coefficients, aMg(|f,|),is approximated by the 
average of the watermarked signal, i.e. avp(|f*/|) 

avg{\f f \)* av9(\f f \) (2) 

55 

This is approximately true since aavg(\f\)W t «f h 
where Wj is the watermark component, and a is a con- 
stant typically in the range between 0.1 and 0.01 . 
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The normalization stage then divides each coeffi- 
cient (// ) in the received signal by the local average avg 
(Kil) in the neighborhood. 

That is, 

f, f l+ aavg(lf t \)W f /, 

The first term, on the right hand side (RHS) of 
Equation (3) , 

f. 



is considered a noise term. This term was not present in 
the system described U.S. Patent Application Serial No. 
08/534,894, because access to the unwatermarked 
coefficients allowed this term to be removed. The sec- 
ond term a W t is the original watermark signal which can 
now be detected using conventional correlation. 

If the watermark is extracted from any single 8x8 
block, the detector reliability is very low. If, however, the 
watermarks extracted from each 8x8 block are first 
added together and the averaged watermark is then 
applied to the correlator, then a very strong and unam- 
biguous response is obtained. This differs from the 
method described in U.S. Patent Application Serial No. 
08/715,953 in which correlation occurred within each 
block and the output from each correlator was averaged 
together. The present invention was found to improve 
the detection response and significantly reduced the 
computation requirement associated with each block. 

In practicing the present invention preferably there 
is a unique PN sequence for each symbol in the alpha- 
bet. The method is relatively robust to clipping since the 
detector output reduces linearly with the quantity of 8x8 
subimage blocks in the image. For DVD (digital video 
disk) embedded signaling for APS (analog protection 
system) and CGMS (copy generation management sys- 
tem), there would be a total of 8 or 1 6 PN sequences. 

The number of 8x8 blocks in a 512x512 image is 
4096, suggesting that significantly more than one of 16 
symbols can be embedded in an image or video frame. 
Assume, for example, that it is desired to embed 1 out of 
128 symbols in an image. It is necessary to perform 128 
parallel correlations. This is computationally tractable 
but hardware implementations of each correlation 
become more complex. An alternative method is to only 
use two binary symbols, ft may be preferable to associ- 
ate more than one PN sequence with each of the two 
binary symbols or bits in order to increase the diff icufty 
of intentionally removing the watermark. In this case, 
there are only two correlators and a binary string may 
be embedded into the image. The raw bit error rate will 
be very high due by the low detector output However, 



this can be reduced to acceptable levels by using error 
correcting codes, such as Reed-Solomon (RS). RS 
codes are robust to burst error which may occur 
because of clipping of the image. Other error correcting 
s codes may also be used. 

When using this method, it is necessary for the 
receiver to know the start location of the encoded block. 
The start location may not be obvious, particularly when 
the image has been subjected to clipping. However, 
io convention synchronizing methods can be used; such 
as preceding each block with a special or unique sym- 
bol or string of symbols. 

To insert a watermark, each 8x8 block is treated as 
an individual subimage or subregion. Tile DCT of the 
15 subimage is then computed and the two dimensional 
DCT is vectorized in the zigzag pattern shown in Figure 
5. although other patterns are also possible. These two 
stages constitute most of the calculations but are part of 
the MPEG encoding process. Next, a PN noise 
20 sequence {w, ...W n} is inserted into the DCT coefficients 
using Equation 1 as before. The length of the PN 
sequence cannot exceed 64 (in an 8 x 8 block) and is 
typically much shorter, in the range of 1 1 to 25. If only a 
single code is to be inserted into the image, then the 
25 same PN sequence is inserted into each of the 720 x 
480/64 - 5400 blocks. However, a variation may be per- 
formed at this point in the procedure. Within each row of 
blocks, the PN sequence is cyclically rotated by one fre- 
quency coefficient prior to insertion in the subsequent 
30 block. Similarly, the PN sequence is cyclically rotated by 
one frequency coefficient at the start of each new row. 
Figure 6 illustrates an order of rotations. 

The purpose of these rotations or shifts is to 
improve the response of the watermark extraction 
35 stage. Earlier experiments revealed that certain DCT 
coefficients were more difficult to estimate than others. 
The location at these coefficients varied from image to 
image. However, within an image, the coefficient could 
be consistently poor. Consequently, without shifting, 
40 one or more of the estimated watermark coefficients 
could be significantly degraded relative to the other 
watermark coefficients, thereby reducing the detector 
performance. Conversely, shifting significantly reduces 
the effect a poor DCT coefficients has on a single water- 
45 mark coefficient and the detector performance is mark- 
edly improved. Note that any cyclic pattern can be used. 

Further modifications are useful once rotation of the 
watermark has been introduced. First, the length of the 
watermark may now be significantly greater than 64. 
so Then, for each block only a small subset of the water- 
mark (say five) coefficients is inserted into the first five 
DCT coefficients (excluding the d.c. term). Because of 
the rotation, a different subset of the watermark is 
inserted into neighboring 8x8 blocks. Finally, having 
55 completed the watermark insertion, the MPEG encoder 
is able to proceed with the subsequent stages of com- 
pression. 

Note that the watermark may also be inserted after 
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the MPEG quantization stage to reduce distortion of the 
watermark. MPEG-2 performs a convenient one dimen- 
sion vectorization called "zigzagging", which allows a 
simple 3x1 box average to be performed on the coeffi- 
cients (excluding thed.c. term). 

In practice, performance was improved if the aver- 
aging is performed using the 2 four-connected coeffi- 
cients closest to the d.c. term, as illustrated in Figure 7, 
i.e. the two coefficients above and to the left. 

Watermark detection begins by first extracting the 
PN noise sequence from each 8x8 block using Equation 
1. For each block, the PN sequence is then cyclically 
shifted in the opposite direction by one frequency coef- 
ficients, and the average over ail the blocks is then com- 
puted. In practice, this process can be computed 
incrementally and does not require temporary storage 
of all the extracted watermarks. A weighted averaging 
can also be applied, where the weights are determined 
based on their susceptibility to common signal transfor- 
mations such as low pass filtering. Finally, the average 
watermark is compared with the original PN sequence 
via correlation. The reason for shifting the watermark in 
the column direction may now be apparent. If the image 
is clipped on an arbitrary block boundary, then the com- 
puted average watermark will simply be rotated by an 
amount that is a function of the relative location of the 
clipped portion of the image. Correlation can then be 
performed on all permutations (typically 1 1 to 25) of the 
watermark. The output from the correlator with the max- 
imum value is then used for decision purposes. The 
extraction stage is depicted in Figure 4. Taking the max- 
imum correlator output over all rotations of the water- 
mark can cause the decision circuitry to be noisy. To 
improve this, the watermark is broken into two pieces; a 
synchronisation portion is of length K and a verification 
portion is N-K. Then, when the watermark is extracted 
as before, correlation is first performed only on all rota- 
tions of the synchronization portion of this watermark. 
The maximum correlation output is noted, then tile veri- 
fication portion of the watermark is rotated by the corre- 
sponding amount and a second correlation is performed 
on the verification portions of the watermarks. This 
process significantly improves the overall reliability of 
the system. In the course of experimentation, it was 
noticed that some watermarks performed better than 
others on the same imagery. This was caused by varia- 
tion in the correlation statistics between the synchroni- 
zation and verification portions of the watermark. 
Ideally, the two portions should have very low correla- 
tions. However, in several cases where watermarks per- 
formed poorly, it was traced to unexpected correlations 
between the two portions. 

The present invention provides a modification to 
digital watermarking methods in which the original data 
is required for watermark extraction thereby enabling 
watermarking extraction in the absence of an unwater- 
marked or original data The present invention prefera- 
bly uses MPEG/JPEG coefficients. An image is divided 



into typically 8x8 block sub images or subregions and 
each subimage is processed and the results are com- 
bined to derive the extracted watermark. The result is 
extraction of the watermark with very high confidence. 

s While the above invention describes improvements 

to the prior-art INSERT-WHOLE, INSERT-MPEG- A, 
EXTRACT WHOLE, and EXTRACT-MPEG-A algo- 
rithms, it should be apparent to anyone skilled in the art 
that the same improvements may be applied to any 

10 algorithm for inserting and extracting watermarks in 
image data. This more general view of the present 
invention is shown in Figures 8 and 9. 

Figure 8 shows a schematic block diagram of the 
general method for inserting watermarks. This general 

is method makes use of a non -block-based watermark 
insertion algorithm, which shall be referred to hereafter 
as the "base insertion algorithm". The watermark 
encoder 80 converts the watermark into a form appro- 
priate for the base insertion algorithm. If the base inser- 

20 tion algorithm is that shown in Figure 1 , for example, 
then the watermark encoder 80 corresponds to the 
watermark encoder 19, which comprises the error cor- 
rection encoder 10, the PN mapper 11, and the spectral 
transformer 12. However, rf a different base insertion 

25 algorithm is to be used, then the watermark encoder 80 
may perform a different transformation of the water- 
mark. The encoded watermark signal from watermark 
encoder 80 is provided as an input to watermark seg- 
menter 81. which divides the watermark into a set of 

30 subwatermarks. Any portion of the original watermark 
might appear redundantly in several of the resulting 
subwatermarks. The data to be watermarked is pro- 
vided as an input to data segmenter 82, which divides 
the data into subregions. Each subwatermark is 

35 inserted into a respective data subregion by a water- 
mark inserter 83a, 83b, etc. The watermark inserters 
implement the base insertion algorithm, so. if the base 
insertion algorithm is that shown in Figure 1 , then each 
watermark inserter 83a, 83b, etc., corresponds to the 

40 watermark inserter 18, which comprises a spectral 
transformer 13, a spectral shaper 1 4, a delay 1 5, a sum- 
mer 16, and an inverse transform 17. However, if a dif- 
ferent base insertion algorithm is to be used, then the 
watermark inserters 83a, 83b, etc., may employ a differ- 
45 errt method of inserting subwatermarks into the subre- 
gions of the data to be watermarked. The outputs from 
the watermark inserters are assembled in data com- 
biner 84 to provide watermarked data. 

Figure 9 shows a schematic block diagram of the 
so corresponding general extraction algorithm. The algo- 
rithm makes use of a "base extraction" algorithm that 
corresponds to the base insertion algorithm used in 
inserting the watermark into the data to be watermarked 
(Figure 8). The algorithm in Figure 9 is substantially the 
55 same as the algorithm shown in Figure 4. except that in 
the general case, the spectrum normalizers 41a, etc. 
are replaced by watermark extractors 91 a, etc.. which 
implement the base extraction algorithm. That is. rf the 



7 



13 



EP 0 840 513 A2 



14 



base insertion algorithm used was the algorithm shown 
in Figure 1, then the watermark extractors 91a, etc., in 
Figure 9 will be the spectrum normalizers 41a, etc. in 
Figure 4. 

While there has been described and illustrated a 5 
system for inserting a watermark into and extracting a 
watermark from watermarked data without using an 
unwatermarked version of the data, it will be apparent to 
those skilled in the art that variations and modifications 
are possible without deviating from the broad principles w 
and teachings of the present invention which shall be 
limited solely by the scope of the claims appended 
hereto. 

Claims 15 

1 . A method for inserting a watermark signal into data 
to be watermarked comprising the steps of: 

dividing data to be watermarked into a plurality 20 
of subregions; 

computing frequency coefficients of the data to 
be watermarked in each subregion; 
spread spectrum modulating a watermark sig- 
nal to be inserted by mapping the watermark 25 
signal into a PN (pseudo-random noise) 
sequence; 

spectral shaping the PN sequence as a func- 
tion of the average power in each frequency 
coefficient of the data; and 30 
inserting each spectral shaped PN sequence 
into predetermined coefficients in the data in 
each subregion. 

2. A method for inserting a watermark signal into data 35 
to be watermarked as set forth in claim 1, where 
said frequency coefficients are DCT (discrete 
cosine transform) coefficients. 

3. A method for inserting a watermark signal into data 40 
to be watermarked as set forth in claim 1 or 2, 
where each subregion is a 8 x 8 block of pixels, 

4. A method for inserting a watermark signal into data 

to be watermarked as set forth in claim 3,where the 4s 
frequency coefficients of the watermark signal are 
rotated prior to inserting of each spectral shaped 
PN sequence into the subregion. 

5. A method for inserting a watermark signal into data so 
to be watermarked as set forth in claim 4, where 
only a subset of the watermark signal frequency 
coefficients is inserted into any one subregion. 

6. A method for inserting a watermark signal into data ss 
to be watermarked as set forth in claim 5 , where 
the watermark signal comprises a synchronization 
portion and a verification portion. 



7. A method for inserting a watermark signal into data 
to be watermarked as set forth in any of claims 1 to 
6, where said inserting is performed after the data 
undergoes MPEG quantization processing. 

8. A method for inserting a watermark signal into data 
to be watermarked as set forth in claims 6 or 7, 
where the synchronisation portion and the verifica- 
tion portion have very little correlation between 
each other. 

9. A method for inserting a watermark signal into data 
to be watermarked as set forth in any of claims 1 to 
8, 

where the spectral shaping as a function of the 
average power is typically 3 x 1 window of the 
coefficient obtained from the one-dimensional 
vectorization by zigzagging of two-dimension 
frequency coefficients. 

1 0. A method for inserting a watermark signal into data 
to be watermarked as set forth in any of claims 1 to 
8, where the spectral shaping is a function of the 
average power based on the two four-connected 
frequency coefficients closest to the DC term. 

11. A method of extracting a watermark from water- 
marked data comprising the steps of: 

receiving subregions of watermarked data; 
spectrum normalizing the watermarked data 
from each subregion to generate respective 
normalized signals; 

combining the respective normalized signals 
from each subregion to generate a single 
watermark; 

correlating the single watermark with predeter- 
mined PN (pseudo-random noise) sequences 
corresponding to predetermined symbols to 
provide correlated signals for each predeter- 
mined PN sequence in each subregion; 
deciding which correlated signal is most likely a 
current symbol; and extracting a sequence of 
most likely current symbols corresponding to 
the watermark. 

12. A method of extracting a watermark from water- 
marked data as set forth in claim 1 1 , where the sub- 
regions are 8 x 8 blocks used for MPEG encoding 
and decoding. 

13. A method of extracting a watermark from water- 
marked data as set forth in claim 12. where said 
combining the normalized signals from each subre- 
gion to generate a single watermark, including 
removing the relative rotation of the watermark 
between blocks. 
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14. A method of extracting a watermark from water- 
marked data as set forth in claim 1 3, further com- 
prising subsequently reconstructing the watermark 
from partial watermarks inserted into each block. 

15. A method of extracting a watermark from water- 
marked data as set forth in claim 1 4, further com- 
prising weighting the watermark coefficients based 
on their location within the frequency spectrum, 
where the weighting is a function of the susceptibil- 
ity of each frequency coefficient to common signal 
transformations. 

16. A method of extracting a watermark from water- 
marked data as set forth in claim 11, further com- 
prising com elating with all rotational shifts of the 
extracted waiermark and selecting the maximum 
value. 

17. A method of extracting a watermark from water- 
marked data as set forth in claim 1 1 . further com- 
prising con elating wrth all rotational shifts of a 
synchronization portion of a watermark to deter- 
mine a maximum value and subsequently rotating a 
verification portion ol the watermark by the same 
amount as the synchronization portion is rotated to 
obtain the maximum value prior to correlating 
between the verification portion and predetermined 
PN sequences 

18. A method of extracting a watermark from water- 
marked data comprising the steps of: 

receiving subregions of watermarked data; 
spectrum normalizing the watermarked data 
from each subregion to generate respective 
normalized signals; 

correlating the respective normalized signals 
with predetermined PN sequences corre- 
sponding to predetermined symbols to provide 
correlated signals for each predetermined PN 
sequence in each subregion; 
deciding which correlated signal is most likely a 
current symbol in each subregion for providing 
an extracted symbol stream; 
error correcting the extracted symbol stream; 
and 

extracting a sequence of most likely current 
symbols corresponding to the watermark. 

19. A method of extracting a watermark from water- 
marked data as set forth in claim 18. where said 
error correction is Reed Solomon error correction. 

20. A method for inserting a watermark signal into data 
to be watermarked comprising the steps of: 

dividing data to be watermarked into a plurality 



of subregions; 

dividing a watermark signal into a plurality of 
subwatermarks where portions of the water- 
mark are contained in more than one subwater- 
5 mark; and 

inserting said plurality of subwatermarks into 
said plurality of subregions. 

21 . A method for inserting a watermark signal into data 
w to be watermarked as set forth in claim 20, where 
each subwatermark is inserted into a respective 
subregion, so that each subregion contains at least 
one subwatermark. 

is 22. A method for extracting a watermark signal from 
watermarked data comprising the steps of: 

receiving a plurality of subregions of watermark 
data; 

20 extracting a subwatermark from each subre- 

gion of said plurality of subregions; and 
combining and averaging the subwatermarks 
extracted from all the subregions to obtain a 
signal commensurate with the watermark sig- 

25 nal. 

23. A method for extracting a watermark signal from 
watermarked data as set forth in claim 22, further 
comprising the steps of: 

30 

dividing the signal commensurate with the 
watermark signal into a plurality of symbol sig- 
nals; 

correlating each symbol signal with a set of 

35 predefined signals; 

determining which predefined signal best cor- 
responds to each symbol signal; and 
combining the best corresponding predeter- 
mined signals to generate the watermark sig- 

40 nal. 
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